SR-tree: An index structure for nearest-neighbor searching of high-dimensional point data

The spatial index structure is one of the most important research topics for organizing and managing massive 3D Point Cloud. As a point in Point Cloud consists of Cartesian coordinates (x,y,z), the common method to explore geometric information and features is nearest neighbor searching. An efficient spatial indexing structure directly affects the speed of the nearest neighbor search. Octree and kd-tree are the most used for Point Cloud data. However, Octree or KD-tree do not perform best in nearest neighbor searching. A highly balanced tree, 3D R*-tree is considered the most effective method so far. So, a hybrid spatial indexing structure is proposed based on Octree and 3D R*-tree. In this paper, we discussed how thresholds influence the performance of nearest neighbor searching and constructing the tree. Finally, an adaptive way method adopted to set thresholds. Furthermore, we obtained a better performance in tree construction and nearest neighbor searching than Octree and 3D R*-tree.

Download Full-text

Angle-Tree: a new index structure for high-dimensional point data

10.1117/12.451114 ◽

2001 ◽

Author(s):

Daoguo Dong ◽

Xiangyang Xue ◽

Hangzai Luo ◽

Yingqiang Lin

Keyword(s):

Index Structure ◽

High Dimensional ◽

Point Data

Download Full-text

High dimensional nearest neighbor searching

Information Systems ◽

10.1016/j.is.2005.01.001 ◽

2006 ◽

Vol 31 (6) ◽

pp. 512-540 ◽

Cited By ~ 16

Author(s):

Hakan Ferhatosmanoglu ◽

Ertem Tuncel ◽

Divyakant Agrawal ◽

Amr El Abbadi

Keyword(s):

Nearest Neighbor ◽

High Dimensional ◽

Nearest Neighbor Searching

Download Full-text

HD-Tree: An Efficient High-Dimensional Virtual Index Structure Using a Half Decomposition Strategy

Algorithms ◽

10.3390/a13120338 ◽

2020 ◽

Vol 13 (12) ◽

pp. 338

Author(s):

Ting Huang ◽

Zhengping Weng ◽

Gang Liu ◽

Zhenwen He

Keyword(s):

Hash Table ◽

Index Structure ◽

High Dimensional ◽

Sequential Search ◽

Storage Space ◽

Hash Tables ◽

Time Performance ◽

Indexing Method ◽

Point Data ◽

Better Than

To manage multidimensional point data more efficiently, this paper presents an improvement, called HD-tree, of a previous indexing method, called D-tree. Both structures combine quadtree-like partitioning (using integer shift operations without storing internal nodes, but only leaves) and hash tables (for searching for the nodes stored). However, the HD-tree follows a brand-new decomposition strategy, which is called half decomposition strategy. This improvement avoids the generation of nodes containing only a small amount of data and the sequential search of the hash table, so that it can save storage space while having faster I/O and better time performance when building the tree and querying data. The results demonstrate convincingly that the time and space performance of HD-tree is better than that of D-tree regardless of uniform or uneven data, which are less affected by data distribution.

Download Full-text

SPY-TEC+ : AN INTEGRATED INDEX STRUCTURE FOR k-NEAREST NEIGHBOR QUERIES WITH SEMANTIC PREDICATES IN MULTIMEDIA DATABASE

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s0218194011005529 ◽

2011 ◽

Vol 21 (07) ◽

pp. 989-1011

Author(s):

DONG-JOO PARK ◽

DONG-HO LEE

Keyword(s):

Visual Information ◽

Nearest Neighbor ◽

High Dimensional Data ◽

Multimedia Retrieval ◽

Index Structure ◽

High Dimensional ◽

K Nearest Neighbor ◽

Nearest Neighbor Algorithm ◽

Integrated Index ◽

Nearest Neighbor Queries

Recently, advanced multimedia applications, such as geographic information system, and content-based multimedia retrieval system, require the efficient processing of k-nearest neighbor queries over large collection of multimedia objects. These queries usually include the semantic information that is represented by text, as well as the visual information that is represented by a high-dimensional feature vector. Among the available techniques for processing such queries, the incremental nearest neighbor algorithm proposed by Hjaltason and Samet is known as the best choice. However, the R-tree used in their algorithm has no facility capable of partially pruning the candidate tuples that will turn out not to satisfy the semantic predicate. Also, the R-tree does not perform sufficiently well on high-dimensional data even though it provides good results on low or middle-dimensional data. These drawbacks may lead to a poor performance when processing the query. In this paper, we propose an integrated index structure, so-called SPY-TEC+, that provides an efficient method for indexing the visual and semantic feature at the same time using the SPY-TEC that was proposed for indexing high-dimensional data, and the signature file. We also propose an efficient incremental nearest neighbor algorithm for processing k-nearest neighbor queries with visual and semantic predicates on the SPY-TEC+. Finally, we show that the SPY-TEC+ enhances the performance of the SPY-TEC for processing k-nearest neighbor queries with visual and semantic predicates through various experiments.

Download Full-text

FASTER SIMILARITY SEARCH FOR MULTIMEDIA DATA VIA QUERY TRANSFORMATIONS

International Journal of Image and Graphics ◽

10.1142/s0219467803000890 ◽

2003 ◽

Vol 03 (01) ◽

pp. 3-29

Author(s):

CHRISTIAN A. LANG ◽

AMBUJ K. SINGH

Keyword(s):

Similarity Search ◽

Nearest Neighbor ◽

Estimation Error ◽

Fractal Dimensionality ◽

Multimedia Data ◽

Index Structure ◽

High Dimensional ◽

Range Queries ◽

Query Execution ◽

Real World Data

The performance of nearest neighbor (NN) queries degrades noticeably with increasing dimensionality of the data due to reduced selectivity of high-dimensional data and an increased number of seek operations during NN-query execution. If the NN-radii would be known in advance, the disk accesses could be reordered such that seek operations are minimized. We therefore propose a new way of estimating the NN-radius based on the fractal dimensionality and sampling. It is applicable to any page-based index structure. We show that the estimation error is considerably lower than for previous approaches. In the second part of the paper, we present two applications of this technique. We show how the radius estimations can be used to transform k-NN queries into at most two range queries, and how it can be used to reduce the number of page reads during all-NN queries. In both cases, we observe significant speedups over traditional techniques for synthetic and real-world data.

Download Full-text

VA-Trie: A New and Efficient High Dimensional Index Structure for Approximate k Nearest Neighbor Query

Journal of Computer Research and Development ◽

10.1360/crad20051227 ◽

2005 ◽

Vol 42 (12) ◽

pp. 2213 ◽

Cited By ~ 2

Author(s):

Daoguo Dong

Keyword(s):

Nearest Neighbor ◽

Index Structure ◽

High Dimensional ◽

K Nearest Neighbor ◽

Nearest Neighbor Query

Download Full-text

Composite Distance Transformation for Indexing and k-Nearest-Neighbor Searching in High-Dimensional Spaces

Journal of Computer Science and Technology ◽

10.1007/s11390-007-9027-5 ◽

2007 ◽

Vol 22 (2) ◽

pp. 208-217 ◽

Cited By ~ 2

Author(s):

Yi Zhuang ◽

Yue-Ting Zhuang ◽

Fei Wu

Keyword(s):

Nearest Neighbor ◽

High Dimensional ◽

K Nearest Neighbor ◽

Distance Transformation ◽

Nearest Neighbor Searching

Download Full-text

PK-Tree: A Spatial Index Structure for High Dimensional Point Data

Information Organization and Databases ◽

10.1007/978-1-4615-1379-7_20 ◽

2000 ◽

pp. 281-293 ◽

Cited By ~ 2

Author(s):

Wei Wang ◽

Jiong Yang ◽

Richard Muntz

Keyword(s):

Spatial Index ◽

Index Structure ◽

High Dimensional ◽

Point Data

Download Full-text

Tropical Balls and Its Applications to K Nearest Neighbor over the Space of Phylogenetic Trees

Mathematics ◽

10.3390/math9070779 ◽

2021 ◽

Vol 9 (7) ◽

pp. 779

Author(s):

Ruriko Yoshida

Keyword(s):

Supervised Learning ◽

Phylogenetic Trees ◽

Nearest Neighbor ◽

Nearest Neighbors ◽

High Dimensional ◽

Learning Method ◽

Dimensional Vector ◽

K Nearest Neighbor ◽

K Nearest Neighbors

A tropical ball is a ball defined by the tropical metric over the tropical projective torus. In this paper we show several properties of tropical balls over the tropical projective torus and also over the space of phylogenetic trees with a given set of leaf labels. Then we discuss its application to the K nearest neighbors (KNN) algorithm, a supervised learning method used to classify a high-dimensional vector into given categories by looking at a ball centered at the vector, which contains K vectors in the space.

Download Full-text